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Abstract 

Bacterial phosphothreonine lyases catalyze a novel posttranslational modification involving formation of dehydrobutyrine/dehyroa- 
lanine by p elimination of the phosphate group of phosphothreonine or phosphoserine residues in their substrate proteins. Though 
there is experimental evidence for presence of dehydro amino acids in human proteins, no eukaryotic homologs of these lyases have 
been identified as of today. A comprehensive genome-wide search for identifying phosphothreonine lyase homologs in eukaryotes 
was carried out. Our fold-based search revealed structural and catalytic site similarity between bacterial phosphothreonine lyases and 
BLES03 (basophilic leukemia-expressed protein 03), a human protein with unknown function. Ligand induced conformational 
changes similar to bacterial phosphothreonine lyases, and movement of crucial arginines in the loop region to the catalytic pocket 
upon binding of phosphothreonine-containing peptides was seen during docking and molecular dynamics studies. Genome-wide 
search for BLES03 homologs using sensitive profile-based methods revealed their presence not only in eukaryotic classes such as 
chordata and fungi but also in bacterial and archaebacterial classes. The synteny of these archaebacterial BLES03-like proteins was 
remarkably similar to that of type IV lantibiotic synthetases which harbor LanL-like phosphothreonine lyase domains. Hence, context- 
based analysis reinforced our earlier sequence/structure-based prediction of phosphothreonine lyase catalytic function for BLES03. 
Our in silico analysis has revealed that BLES03-like proteins with previously unknown function are novel eukaryotic phosphothreonine 
lyases involved in biosynthesis of dehydro amino acids, whereas their bacterial and archaebacterial counterparts might be involved in 
biosynthesis of natural products similar to lantibiotics. 

Key words: eliminylation, phosphothreonine lyase, fold-based function annotation, divergent evolution, genome context 
analysis, lanthionine synthetase. 



Introduction 

Posttranslational modification (PTM) of proteins typically 
involves covalent attachment of a chemical moiety to side 
chains of amino acids in proteins. Eliminylation is a recently 
characterized PTM which irreversibly removes the phosphate 
group of phosphothreonine (pT) or phophoserine (pS) residues 
in phosphorylated proteins, thereby converting them to 
dehydrobutyrine (Dhb) or dehydroalanine (Dha), respectively 
(fig. 1). Eliminylation reaction is catalyzed by a newly discov- 
ered family of enzymes, namely Phosphothreonine lyases 
(PTLs). Unlike phosphatases which restore the hydroxyl 
moieties of Thr/Ser by removing the phosphate group, PTLs 
remove the phosphate group using p-elimination reaction 
involving Cp-OP bond. Removal of phosphate group by 
P-elimination results in the formation of Ca = cp double 
bond (Li et al. 2007) and the hydroxyl group of Thr/Ser is 
not restored. In absence of the hydroxyl group eliminylated 



Thr/Ser cannot be phosphorylated again, thus the modifica- 
tion by eliminylation is irreversible. 

Eliminylating enzymes were recently discovered in three 
different pathogenic bacteria. It was shown that Shigella 
OspF (Li et al. 2007), Pseudomonas HopAH (Zhang et al. 
2007) and Salmonella SpvC (Mazurkiewicz et al. 2008) 
dephosphorylate pT of pT-X-pY motif in dual phosphorylated, 
fully active host MAPKs (mitogen-activated protein kinases). 
This inactivates MAPKs in the host, hindering the initiation of 
innate immune response and thus increasing the chances of 
survival of these pathogens. Structural and biochemical anal- 
yses have provided novel insight into the catalytic mechanisms 
of PTLs (Zhu et al. 2007; Chen et al. 2008). Crystal structures 
of SpvC-like PTL from different organisms, such as Salmonella 
typhimurium [Protein Data Bank (PDB): 2Z8N, 2Z8M, 2Z80, 
2Z8P] (Chen et al. 2008), Chromobacterium violaceum [PDB: 
3B06], Sa. enterica [PDB: 2P1 W, 2Q8Y] (Zhu et al. 2007), and 
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Fig. 1. — Schematic representation of eliminylation reaction. 



Shigella flexneri [PDB: 3I0U] have shown that it has a novel 
ot/p-fold. The binding mode of SpvC-like PTL to its substrate 
was revealed by crystal structures [PDB: 2Q8Y and 2Z8P], 
where substrate peptide was cocrystallized along with SpvC- 
like PTL (Chen et al. 2008). The substrate-bound structure 
showed that pY of pT-X-pY motif was essential for substrate 
recognition and the positively charged pocket that binds pY is 
distinct from the pocket that binds and eliminylates pT (Zhu 
et al. 2007; Chen et al. 2008). 

More recently, PTL catalytic activity has also been discovered 
in type IV lantibiotic synthetases involved in biosynthesis of 
lantibiotics in bacteria (Goto et al. 2010, 2011). Lantibiotics 
are posttranslationally modified bioactive peptides containing 
characteristic lanthionine (Lan) or methyllanthionine (MeLan) 
groups. Lantibiotics are synthesized by four classes of lantibio- 
tic synthetases (Goto et al. 2010). VenL-like multifunctional 
enzymes, belonging to type IV lantibiotic synthetases (LanL), 
harbor three distinct functional domains and catalyze biosyn- 
thesis of lanthipeptides using Ser/Thr- and Cys-rich ribosomal 
polypeptides as substrates. Interestingly, bioinformatics analy- 
sis of LanL-like multidomain enzymes suggested the presence 



of Ser/Thr kinase-like domain, C-terminal cyclase domain, and 
a novel N-terminal domain that shared weak homology with 
SpvC-like PTLs. The catalytic residues of PTL (Goto et al. 2010, 
2011) were conserved in this N-terminal domain, thus 
suggesting putative lyase activity. Together these results 
suggested the possible role of LanL in biosynthesis of lanthi- 
peptides using a new mechanism. Subsequent experimental 
studies confirmed PTL activity in the N-terminal domain of 
VenL and led to the discovery of novel type IV biosynthetic 
pathway for lanthipeptides. The kinase domain of LanL-like 
proteins first phosphorylates a Ser/Thr residue on Ser-, Thr-, 
and Cys-rich precursor peptide and this is followed by eliminy- 
lation of phosphate group by lyase domain converting pT or pS 
to Dhb or Dha, respectively. C-terminal cyclization domain 
subsequently catalyzes formation of Lan or MeLan moieties 
by covalent attachment of Dha or Dhb residues to cystines. 

Discovery of type IV lantibiotic synthetases has established 
that PTL domains are not only present in bacterial virulence 
factors but also in bacterial biosynthetic pathways. However, 
straightforward sequence similarity searches indicate that 
both these families of lyases are restricted to bacteria only. 
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Brennan et al. have hypothesized that eliminylating enzymes 
might be involved in eukaryotic signaling (Brennan and Barford 
2009). Even though until now there are no reports of PTL 
homologs in eukaryotes, crystallin in aging human lenses 
was found to contain Dha (Srivastava et al. 2004) and the 
crosslink formed by Dha and lysine/cysteine is also present in 
human lenses (Linetsky et al. 2004). However, enzymes which 
catalyze formation of these dehydro amino acids in crystallin 
are not known. In view of the presence of large number of 
protein kinases in human genome, it is tempting to speculate 
that formation of these dehydro amino acids might be cata- 
lyzed by type IV lantibiotic-like pathways involving kinase and 
phosphothreoine lyase activity. Interestingly, LanC-like pro- 
teins are also present in humans and certain other eukaryotes, 
though their function is yet unknown (Brennan and Barford 
2009). It is possible that LanC-like enzymatic domains act on 
Dha/Dhb and crosslink them to cysteine or lysine residues. 
These evidences encouraged us to systematically search for 
putative PTL-like enzymes in humans and other eukaryotes. 

In this work, we have carried out a systematic genome-wide 
search for PTL enzymes in humans and other eukaryotes. 
Apart from sensitive profile-based searches, using the crystal 
structures of bacterial pathogenic PTLs, we have also carried 
out search for proteins which can potentially adopt PTL fold. 
Structure-based analysis revealed that BLES03 (basophilic leu- 
kemia-expressed protein 03), a human protein with unknown 
function, has PTL fold and it also shares catalytic residues of 
known PTLs. Our analysis also revealed presence of BLES03- 
like proteins in archaea. The resemblance of the genomic 
neighborhoods of archaeal BLES03-like enzymes to other cat- 
alytic components of LanL pathway also suggests PTL-like 
function for BLES03 homologs in archaea. Hence, a completely 
independent line of evidence based on synteny also suggests 
PTL activity for BLES03. Interestingly, our genome-wide search 
has led to the discovery of putative PTL-like proteins in all three 
branches of life, that is, bacteria, archaea, and eukaryotes. We 
also consider the evolutionary relationship between these fam- 
ilies, whose sequences are so diverged that even most sensitive 
sequence-based method did not find any relation, yet their 
overall three-dimensional fold and active site pocket have 
been conserved to perform same function. Association of 
BLES03 with many different cancer types and known role of 
bacterial PTLs in disruption of MAPK signaling also suggest that 
our in silico prediction of PTL activity for BLES03 might provide 
clues for deciphering molecular basis of the disease association 
of BLES03. 

Materials and Methods 

Sequence- and Structure-Based Search for New 
Eliminylating Domains 

In order to identify novel eliminylating domains in various gen- 
omes, the experimentally characterized eliminylating domains 



[Swiss-Prot: P0A2M9, F2R8I9 and Q9H3H3] were used as 
query and sequence similarity searches were carried out in 
nonredundant (nr) database (released in September 2012) of 
National Center for Biotechnology Information (NCBI). Same 
searches were also carried out on current version of nr data- 
base using the web version of BLAST tool. Though the number 
of hits has increased, overall taxonomical distribution of the 
families remained same. For VenL [Swiss-Prot: F2R8I9], 
sequence of the fragment containing only the PTL domain 
(amino acid range 1-170) was used as query. Pairwise se- 
quence similarity searches were carried out using a local 
version of NCBI BLASTp (Altschul et al. 1997; Johnson et al. 
2008) and PSI-BLAST program was used for iterative profile- 
based searches. The E value cutoff of 10~ 3 was used for both 
BLASTp and PSI-BLAST. First round of PSI-BLAST uses BLAST to 
search for proteins similar to query in a given database. For 
subsequent rounds, it builds a profile from the hits obtained in 
previous round and queries the target using this profile. The 
sequence of events was repeated till no new sequences could 
be added to the profile, that is, till convergence. Profile-profile 
comparisons were done using HHPred server (Soding et al. 
2005). In addition to comparison of profiles built using two 
sequences, it also compares their secondary structures. To 
assess the significance of results, HHPred reports E value and 
probability. E value does not incorporate secondary structure 
score, whereas probability contains contribution from second- 
ary structure score. PDB70 and PfamA databases of HHPred 
were used as target for remote homology detection. PDB70 
database contains profiles built using representative sequences 
present in PDB database. The sequences are chosen such that 
no two sequences have identity above 70%. PfamA database 
is derived using multiple sequence alignment of all PfamA 
families from Pfam database. Multiple sequence alignment 
(MSA) for each family was built using ClustalW. For phyloge- 
netic analysis, bootstrapped trees were built and visualized 
using QuickTree (Howe et al. 2002) tool of PHYLIP package 
and iTOL, respectively. The phylogenetic trees can be accessed 
using the URL http://itol.embl.de/shared/shradha (last accessed 
July 30, 2014). Also the Newick files of all three trees have 
been included in supplementary files S3-S5, Supplementary 
Material online, for detailed analysis. The leaves of phyloge- 
netic trees are colored based on the taxonomical classes and 
the text on branches indicates bootstrap values using 1,000 
replicates. Aligning sequences from the distantly related four 
families was complicated. In order to increase the quality of 
alignment for such highly divergent proteins, structural infor- 
mation has been used by PROMALS-3D software to guide the 
MSA (Pei et al. 2008). As no crystal structures were available 
for LanL-like proteins, representative sequences of LanL-like 
proteins VenL and RamC were given as input along with crys- 
tal structures of BLES03 and SpvC-like PTL to obtain the MSA 
(fig. 5). 

DaliLite program from Dali server (Holm and Rosenstrom 
2010) was used to search for the structural neighbors of the 
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eliminylating domains in PDB by using the structure 2Q8Y of 
known eliminylating domains as query. DALI Z score above 
2 indicates statistically significant structural similarity. 
Superimposition, analysis, and visualization of the structures 
were carried out using academic version of PyMOL software 
The PyMOL Molecular Graphics System, Version 0.99 
Schrodinger, LLC. PROMALS3D (Pei et al. 2008) software 
was used for structure-based MSA of representative 
sequences and structures of known and predicted eliminylat- 
ing domains. PROMALS3D has advantage over conventional 
MSA software because it uses structural information and 
predicted secondary structure to guide the MSA of distantly 
related homolog. PROMALS3D employs PSIPRED (McGuffin 
et al. 2000) algorithm to predict secondary structure for 
those proteins for which three-dimensional structures are 
not available. In addition to PROMALS3D, secondary structure 
predictions were also carried out for VenL-like proteins using 
JPRED server (Cole et al. 2008). The results from JPRED were 
compared with consensus secondary structure predicted by 
PROMALS3D and available crystal structures of SpvC-like 
PTL, BLES03, and elF4E. Schematic diagrams of pT- and phos- 
photyrosine-binding pocket interactions were generated using 
LigPlot+ (Laskowski and Swindells 201 1). 

Modeling of BLES03-Peptide Complex 

In order to model the substrate peptide in complex with 
BLES03, flexible docking approach was used and the peptide 
fragment QYFMpTE from ERK (extracellular-signal regulated 
kinases) was docked in the catalytic site of BLES03 using 
HADDOCK server (de Vries et al. 2010). During the docking 
process, apart from the substrate peptide ligand, the loop 
region on the BLES03 receptor which contained R215 and 
R1 69 was also made flexible. HADDOCK outputs top ten clus- 
ters ranked by HADDOCK score, which includes various energy 
terms. As all the top ten complexes had ERK peptide docked in 
the active site pocket predicted by us, the cluster where pT was 
at a distance of 6 A or less from K1 28, K204, K1 59, and Y1 75 
was chosen. The conformation with minimum energy in the 
chosen cluster was considered for further analysis. 

Molecular Dynamics Simulations on BLES03-Peptide 
Complex 

It may be noted that HADDOCK permits only a limited 
number of residues to remain flexible during docking. To in- 
corporate complete flexibilities in both ligand and the recep- 
tor, explicit solvent molecular dynamics (MD) simulations were 
carried out on the complex obtained from HADDOCK. The 
peptide bound BLES03 was solvated in a rectangular box of 
TIP3 water molecules (Jorgensen et al. 1983) such that box 
boundaries were at a distance of 10 A from the outermost 
atoms of the BLES03-peptide complex along x, y, and z direc- 
tions. MD simulations were carried out using AMBER1 1 pack- 
age (Case et al. 2010) and ff03 force field (Duan et al. 2003). 



Force field parameter assigned for the pT residue in the sub- 
strate peptide was based on earlier work by Homeyer et al. 
(2006). The initial structure of the solvated BLES03-peptide 
complex minimized using steepest descent approach to a 
RMS gradient of 0.001 kcal/mol/A using SANDER module. 
After minimization, MD runs were carried out by SANDER 
module using a time step of 1 fs and bonds involving hydro- 
gen atoms were constrained using SHAKE (Ryckaert et al. 
1977). Long-range electrostatic interactions were computed 
using Partile Mesh Ewald (Darden et al. 1993) approach and a 
cutoff of 10 A was used in the direct space for nonbonded 
interactions. Before starting the production dynamics, a two- 
stage equilibration process was carried out. In the first stage, 
simulations were carried out in NVT ensemble and the tem- 
perature of the system was gradually raised from 1 00 to 300 K 
over 5 ps of simulation time. The temperature of the system 
was constrained to the specified value by using Langevin 
dynamics temperature coupling scheme and collision fre- 
quency of 5ps~ 1 . After the temperature of the system 
reached 300 K, the second stage of equilibration was carried 
for 1 5 ps using NPT ensemble to adjust the density of water to 
1 g/cc. After the system had attained a temperature of 300 K 
and density of 1 g/cc, production run was carried out for 30 ns 
in NPT ensemble using PMEMD (Partile Mesh Ewald Molecular 
Dynamics) module of AMBER. 

As the distance between R169 and pT219 was approxi- 
mately 14 A, the cutoff was increased to 16 A for a short 
period between 17 and 21 ns of simulation time. Indeed, this 
was the period in which R169 came closest to pT219. The dis- 
tance between R1 69 and pT2 1 9 in various conformations sam- 
pled during the MD simulations was calculated using ptraj 
module of AMBER and other in-house PERL scripts. In order 
to analyze the solvent accessible surface area of pT, coordinates 
were extracted from the trajectory at an interval of 100 ps in 
PDB format using ptraj module and provided as input to the 
NACCESS program (Hubbard and Thornton 1993). 

Synteny of BLES03 and LanL-Like Proteins 

Contextual information was extracted from fully sequenced 
prokaryotic genomes using in-house Perl scripts. Five genes 
from upstream and five from downstream were picked. 
When completely sequenced genomes were not available 
Gene database from NCBI was queried. In addition to records 
from fully sequenced genomes, intermediate genomic records 
from unfinished genomic data are also included in Gene data- 
base. Functional information about neighboring genes was 
extracted using Pfam database (Finn et al. 2014). 

Results 

Sequence-Based Search for PTLs 

Pairwise BLASTp (Johnson et al. 2008) and profile-based 
PSI-BLAST (Altschul et al. 1 997) searches using SpvC sequence 
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as query yielded a total of 53 and 55 proteins, respectively 
(supplementary file S1, Supplementary Material online). All 
these proteins showed a sequence similarity in the range of 
41-100% to SpvC with query coverage above 34%. 
Phylogenetic distribution of PTL sequences obtained by 
BLAST and PSI-BLAST searches was restricted to pathogenic 
bacterial species (supplementary table S1 in supplementary file 
S2, Supplementary Material online) (fig. 2/4). In contrast to 
SpvC-like PTLs, PSI-BLAST search for sequences similar to 
lyase domains of VenL-like proteins resulted in 301 hits (sup- 
plementary table S1 in supplementary file S2 and supplemen- 
tary file S1, Supplementary Material online). These 301 
proteins showed a sequence similarity in the range of 
41-100% to PTL domain of VenL with query coverage 
above 24%. The distribution of these VenL-like PTL domains 
was also restricted to bacterial families (fig. 26). Although 
VenL-like proteins are present in diverse bacterial classes 
such as actinobacteria, proteobacteria, bacilli, sphingobac- 
teria, and mollicutes, the distribution of SpvC-like PTLs is re- 
stricted to alpha, beta and gamma classes of proteobacteria. 
There was no overlap between sequence neighbors of SpvC- 
and VenL-like PTL domains, indicating high sequence diver- 
gence between these families despite similarities in catalytic 
activity. Structure-based profile-profile comparison tool, 
HHPred (Soding et al. 2005), was used to search the Pfam 
database (Punta et al. 2012) for protein families that are sim- 
ilar to SpvC- and VenL-like PTLs. Interestingly with both SpvC 
and VenL as query, top hit was SpvC-like PTL family [Pfam ID: 
PF03536] (supplementary table S24 in supplementary file S2, 
Supplementary Material online). Similarly, when HHPred 
search was carried out in PDB70 database (Soding et al. 
2005) using SpvC- and VenL-like PTL domains as query, the 
crystal structures of SpvC-like PTLs were obtained as top hits 
(supplementary table S26 in supplementary file S2, 
Supplementary Material online). Though an evolutionary link 
could be established between SpvC- and LanL-like PTLs, 
BLASTp, PSI-BLAST and HHPred could not detect any new 
families of PTL. 

Structure-Based Search for Remote Homologs Reveals 
PTLs Function for BLES03 

It is well known that there are several enzyme families, which 
show a conserved structural fold despite high degree of 
sequence divergence (Furnham etal. 2012). Hence, structural 
similarity searches were carried out in PDB using Dali Server 
(Holm and Rosenstrom 201 0) to identify sequences that adopt 
unique a/p-fold of PTLs (Ke et al. 201 1). Table 1 shows results 
of Dali search ranked by Zscore when the PTL structure 2Q8Y 
was used as a query. Apart from other known PTL structures, 
eukaryotic translation initiation factor 4E (elF4-E) [PDB: 2IDR] 
and human basophilic leukemia-expressed protein (BLES03) 
[PDB: 2Q4K] have RMSD values less then 4 A and Z score 
above 5, indicating statistically significant structural similarity 



with PTL. Although structural similarity between BLES03 and 
elF4-E has been reported by SCOP (Andreeva et al. 2008), 
their similarity to PTLs was not reported earlier. Figure 3 
shows the structural fold and topology diagram for represen- 
tative structures belonging to PTL, elF4-E, and BLES03 families. 
As can be seen from figure 3A, the core region consisting of 
an array of seven beta strands and two helices is conserved in 
all three different structures in terms of their relative orienta- 
tion and packing. However, in one-dimensional sequence the 
order in which these structurally equivalent segments are 
connected is different (fig. 36). Hence, sequence/profile- 
based methods such as PSI-BLAST or Pfam failed to detect 
any similarity between these three protein families. 
However, structure-based search could identify similarity be- 
tween SpvC-like PTL, elF4-E, and BLES03. It is interesting to 
note that among these three protein families which adopt a 
common fold, functions are known for PTL and elF4-E 
(Matsuo et al. 1997). The native function of elF4-E is to bind 
m7G cap at the 5'-end of mRNA and thereby help in recruiting 
ribosomes to mRNA (Sonenberg 2008). However, BLES03 is 
essentially a protein with unknown function. Therefore, to 
explore whether BLES03 shares catalytic residues with PTL or 
elF4-E we analyzed the putative active site pocket of BLES03. 

Structural and biochemical analyses have elucidated cata- 
lytic mechanism of SpvC-like PTLs (Zhu et al. 2007; Chen et al. 
2008; Ke et al. 201 1). The crystal structure of PTL from 5a. 
enterica in complex with the substrate peptide 
(QYFMpTEpYVA) from human ERK revealed that active site 
of PTL consists of a phosphotyrosine-binding pocket in addi- 
tion to the pT-binding pocket (fig. 4). The phosphate moiety 
of pY is located in a positively charged groove composed of 
K1 34, K1 60, and F1 00 (fig. 46) (Zhu et al. 2007). The binding 
of this phosphate helps in orienting the pT moiety of the sub- 
strate into the catalytic pocket of the enzyme (Li et al. 2007). 
The phosphate group of pT makes strong ionic interactions 
with side chains of K104, R148, R213, and R220 (fig. 46). 
These interactions lock pT in the catalytic pocket and also 
prevent its hydrolysis. K104, K136, H106, and Y158 are in- 
volved in catalysis. K1 36 abstracts an a-proton of pT creating a 
carbanion intermediate which is stabilized by K104 and Y1 58 
(Zhu et al. 2007; Chen et al. 2008; Ke et al. 2011). H106 
protonates oxygen atom of the leaving phosphate group 
and Y158 keeps K136 in a deprotonated state, which is es- 
sential for its role as a catalytic base (Zhu et al. 2007). Figure 5 
shows the conservation profile of these catalytic and binding 
pocket residues in the structure-guided multiple sequence 
alignment (MSA) for the four families, that is, SpvC-like PTL, 
lyase domain of LanL-like proteins, BLES03, and elF4-E. The 
comparison of secondary structure of the LanL-like proteins 
predicted by Jnet algorithm (Cole et al. 2008) and consensus 
secondary structure predicted by PROMALS3D revealed that 
they are highly likely to share the conserved structural fold of 
PTLs. As can be seen from figure 5, pY-binding residues were 
not conserved in LanL, BLES03, and elF4-E. Earlier studies 
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■ Gammaproteobacteria 

■ Betaproteobacteria 

■ Alpha proteobacteria 

□ Actinobacteriadae 

■ Coriobacteridae 

□ Bacilli 
Clostridia 

□ Deinococci 

□ Deltaproteobacteria 

■ Nitrospirates 

■ Sphingobacteria 

□ Mollicutes 

□ Nanohaloarchaea 



Fig. 2. — Phylogenetic tree of SpvC-like PTL and VenL-like lyase. Phylogenetic trees built using SpvC-like PTL (top) and VenL-like lyase (bottom). Color on 
outer ring indicates taxonomic classes of the organism to which the respective protein belongs. To improve clarity, clades containing proteins from same 
taxonomical classes were collapsed and labeled. The numbers in bracket indicate number of sequences contained in that clade. The numbers next to 
branches indicate number of time-associated branches clustered together in a bootstrap test with 1,000 replicates. 



which used sequence-based alignments could not find resi- 
dues corresponding to R213 and R220 in LanL (Goto et al. 
2010, 201 1). Structure-guided sequence alignment (Pei et al. 
2008) revealed that all seven active site residues were 



conserved (fig. 5) in LanL, consistent with experimentally ver- 
ified PTL activity of these enzymes in lanthipeptide biosyn- 
thetic pathways. elF4-E showed conservation of only three 
of seven pT-binding residues. Putative catalytic residues 
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Table 1 

Results of Structural Similarity Search Using DALI 



Query 


Hits 


DALI Z Score 


RMSD 


Nalign 


% Sequence 
Similarity 


Family 


2Q8Y 


2Z8P 


39.1 


0.4 


213 


100 


PTL 




3B06 


35.8 


1.1 


213 


80 


PU- 




3I0U 


30.7 


1.1 


191 


93 


PIL 




2IDR-A 


5.8 


3.6 


114 


11 


elF4E-1 




2Q4K-A 


5.7 


3.8 


122 


7 


C110RF68 




1ZTP-A 


5.6 


3.3 


120 


7 


C110RF68 




3NOG-B 


4.7 


4 


107 


6 


Acriflavin resistance protein 




3DQ0-A 


4.6 


3.8 


116 


9 


Cytokinin dehydrogenase 




Fig. 3— Structural similarity of elF4-E, BLES03, and SpvC-like PTLs. (A) Cartoon representation of elF4-E (left), BLES03 (center), and SpvC-like PTL (right). 
Catalytic residues in SpvC and the residues of elF4-E and BLES03 which align with catalytic residues of SpvC are shown in stick representation. Conserved 
core is shown in rainbow colors. Corresponding core secondary structures in ribbon and topology diagrams (8) have same colors. (6) Topology diagrams 
generated using PDBSUM. 



Genome Biol. Evol. 6(8):201 7-2033. doi:10.1093/gbe/evu161 Advance Access publication July 24, 2014 



2023 



Khater and Mohanty 



GBE 




2q4k_chamA.p001/l -228 
i ztp.ehainAj 0 OS/1 -228 
22tm_ch*inAj002/l -21 5 
2pl n_chainAj>004/l -203 

jbJiEA03262.1_:l -170.bifu/l-170 
gU11899S51 l-240.pmati/1.240 
lejl_(hainA.p007/l -178 
Jnet SS Pred. (VenL) 
Consensus SS (PROMALS3D) 



2q4k.chainA.p001 II -228 
1 Ztp.cbainAjOOS/1 -228 
2z8m_ch»inA_p002/l -21$ 
2pl rt.cttiinA.p004/l-203 

gb_A£A03262.1_:l -170_bifu/l-170 
gi.31 1 S99S51 :i -240_pulili/l -240 

ltjl_(hHinA_p007/l -178 

Jnet SS Pred (VenL) 
Consensus SS (PROMALS3D) 

2q4k_ch»inAj>Q01/l-228 
1 ztp.chainAj>OOS/l -228 
2z8m_chainA_pO02/l -215 
2pl H.chainAj 004/1-203 

gbJ^EA032S2.1.:l-!70.bifu/l-170 
gi.3t 1 899851 .1 -24 0_ptilMi/t -24 0 
1 tjl_chainA_p007/l -1 7S 

Jnet SS Pred (VenL) 
Consensus SS (PROMALS3D) 




★2 

LMH LAP- 
L 1-1 H LAP - 
FH I SVLR-- 
F H I S V L R - • 
L H L S AT A - ■ 
I HVS AVP- • 



. CF KLDHAWAG I ARAV 

■ GF KLDHAWAG I ARAV 

■ - DMVPQAFQALS G L L 

■ - DMVPQAFQALS C L L 
. - AS APAVL EKALGVl 
. - AEAAEV I E LVT AY C 



1 KRlGRWL IT LNKQQRRSDLDRFWLET LLCL 



VEG R LQV- - A 

VEG RLQV--A| 

FS E DSPVDK- 

F S E DSPVDK- 

LRE ESPF 

L E A G L PH- - - 

I G ES F DDY S DDVCG- - AV 



SPR 40 

SPR 40 

T DM 41 

T DM 41 

FARS LDQVSA43 
F LVSPRVHHL43 
NV 49 



★4 



★5* 



41 AKEGGRQ-V I CVl 

41 AKEGGRQ-V I CVl 

42 - EKVVQQARVS LGAQ FT L| 
42 - EKVVQQARVS LGAQ F T L[ 
44 LNSRATPLGSSGK-F ITVl 
44 NNGKYARRGSSGK- LMT l\ 
SO RAKGD- K IAIWTTE CEN- 



DD FTD R LGVLEADSAI RAAGI 74 

DD FTD RLGVLEADSAI RAAGI -- 74 

I KPDQENSQYS ASF LHKTRQF I ECLESRLSEN91 

IKPDQENSQYS ASF LHKTRQF I ECLESRLSEN91 

PRS DAGAAR LAHELHGATA- - 80 

PAD DER LRDALDGLGPLLA- - 80 

RDAVT H I GRVYKERLG- - 81 



7S KCL 

75 KCL 

92 GV I S GQCP ES DVHP ENWKY 

92 GV 1 SGQCP ES DVHP ENWKY LS 

81 GLAGPRI LSDQPY AVNS LVH| 

81 GRRAPYVLGDLRWQD-GPiY 

82 IPPKIV I C 



*7 

KPDVYTY LG I y RANRWH- LC 
KPDVYTY LG IYRAN RWH - (, C 

RN E LRS G- -RDGGEM QR 

RNELR .......QR 

RY - - GS F VGRRR LS DDG L LV 
LRY--GGF LERWCT LESGERV 
|QS HAOT A-T KS KNR 



99 
99 
12 
12 
1! 
11 
10 



Fig. 5. — Structure-based sequence alignment provides a link between BLES03, SpvC, and LanL lyase. Structure-based sequence alignment of BLES03 
structures (highlighted in blue), SpvC-like PTL structures (in pink), structure of elF4-E (in yellow), and sequences of LanL-like proteins (in green). Red star: active 
site residues of SpvC-like PTL; blue circles: phosphotyrosine (pY)-binding residue. Consensus secondary structure and predicted secondary structure of VenL's 
lyase domain are shown below the alignment. Numbered stars provide a link to same residues in figure 6A 



corresponding to K136 and H106 were also not conserved. 
This explains lack of PTL activity of elF4-E despite sharing a 
structural fold similar to PTLs. Interestingly, BLES03 had two of 
the three catalytic residues absolutely conserved. The residues 



corresponding to K136 and Y158 of PTL corresponded to 
K159 and Y175, respectively, in BLES03. A closer look at the 
superimposed structures of PTL and BLES03 revealed that 
amino group of K204 (BLES03) was closest to the leaving 
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oxygen atom (fig. 6/4) and K204 could play the role of H 106 of 
PTL. Several earlier studies have demonstrated that Lys can 
indeed protonate leaving groups (Stein and Geiger 2002; 
Castro et al. 2009; Ganguly et al. 2009). It has also been 
shown that upon H106K mutation, PTL retains partial activity 
(5- to 7-fold decreased K cat value) in transferring a proton to 
the leaving group (Zhu et al. 2007) and its K m value is com- 
parable to the wild-type enzyme. The residue corresponding 
to K204 in SpvC-like PTL is R213 which is a weaker proton 
donor compared with lysine. Out of the four amino acids 
K104, R148, R213 and R220 that comprise pT-binding resi- 
dues in SpvC-like PTL, residues equivalent to K104 and R213 
were also conserved in BLES03 (i.e., K128 and K204, respec- 
tively). As discussed later, structural counterparts of pT-bind- 
ing residues, R148 and R220, were also found to be present in 
BLES03. Hence, structural fold-based comparison indentified 
novel PTL which even the most sensitive sequence-based 
method could not detect. 

MD Simulations Reveal Ligand-lnduced Conformational 
Changes in BLES03 Similar to PTLs 

The fold level structural comparisons and analysis of active site 
residues suggested that BLES03 has two out of the three cat- 
alytic residues and two out of the four pT-binding pocket 
residues conserved. Conformational changes upon binding 
of substrate have been observed in SpvC which brings R220 
and R1 48 into the catalytic pocket (Chen et al. 2008) (fig. 6fi). 
This prompted us to search for residues on loop regions of 
BLES03 that could be involved in recognition of pT and help in 
its PTL enzymatic activity. Based on structural superposition of 
BLES03 and PTL crystal structures, R215 and R169 were 
identified as probable residues in BLES03 that could flip into 
the catalytic pocket once substrate is bound. Interestingly, 
B-factor of loop containing R215 and R169 was higher than 
rest of the protein (fig. 60- In order to test this hypothesis, the 
pT-containing peptide fragment QYFMpTE from ERK, native 
substrate of SpvC-like PTL, was docked onto BLES03 structure 
using HADDOCK web server (de Vries et al. 2010). Top ten 
high scoring clusters from HADDOCK web server showed that 
the ERK peptide always docked into the predicted catalytic 
pocket (supplementary fig. S16 in supplementary file S2, 
Supplementary Material online). In majority of these clusters, 
R215 also showed a movement toward the catalytic pocket 
(fig. 6D and supplementary fig. S14 in supplementary file S2, 
Supplementary Material online). However, R169 did not show 
any significant movement. 

Multiple explicit solvent MD simulations of 30 ns duration 
were performed on BLES03-peptide complexes to investigate 
if incorporation of complete flexibilities can bring R169 closer 
to the pT. MD simulations revealed that ERK peptide 
was stable in the putative catalytic pocket of BLES03 and 
R215 that had flipped into the pocket during docking, re- 
mained in the catalytic pocket for the entire 30 ns of the 



MD trajectory. The variation of the shortest distance between 
the side chain N atoms of R160 and oxygen atoms of the 
phosphate group of pT over the 30 ns trajectory (fig. 7) 
revealed that at several time points the distance reduced to 
7.8 A from an initial distance of 1 3. 1 A. It is possible that R1 69 
moves away from pT because of the electrostatic repulsion 
between R169 and R215. Thus, MD simulations reveal that 
upon substrate binding the loop region of BLES03 can adopt a 
conformation where R169 is in vicinity of catalytic pocket 
(fig. 7), but due to its higher energy the conformation is not 
stable. Also, its movement toward catalytic pocket involves 
rearrangement of secondary structure, which is usually diffi- 
cult to see during simulations spanning over nanoseconds 
only. The strong interactions made by positively charged 
pocket in SpvC locks the ERK peptide in its position and also 
shield it from solvent for efficient catalysis (Zhu et al. 2007). 
The average exposed polar surface for pT was 4.43 A 2 , thus 
indicating that it is shielded from solvent, even when interac- 
tion from R169 is missing. Thus, docking and MD revealed 
substrate induced conformational changes in BLES03 similar 
to that of SpvC-like PTL. Molecular dynamic simulations also 
showed that the pT-containing peptide is stable inside BLES03 
pocket, which supports our hypothesis of BLES03 having PTL 
activity. 

Genome Mining for BLES03 Homolog Reveals 
Conservation of Putative PTL Catalyzing Residues 

In order to search for BLES03 homologs from all the king- 
doms of life, PSI-BLAST search in nr database was carried 
out using BLES03 as query. Figure 8 shows phylogenetic 
tree built using hits obtained from PSI-BLAST search. 
Interestingly, BLES03-like proteins were present in all the 
three kingdoms of life, that is, bacteria, archaea and eu- 
karyotes (fig. 8A; supplementary file S1, Supplementary 
Material online). Out of the 167 statistically significant 
hits obtained from the PSI-BLAST search, a majority of 
BLES03 homologs (142) were from eukaryota, 19 homo- 
logs were from archaea, and 6 hits corresponded to bac- 
teria. As can be seen from table 2, all the active site 
residues of PTL, except R215, are indeed conserved in 
BLES03 homologs from all three kingdoms of life (fig. 
86). Such high degree of conservation among distant ho- 
molog provides additional support for our prediction of PTL 
catalytic function for BLES03. BLES03-like proteins are pre- 
sent in eukaryotes such as chordata as well as dikarya. In 
addition to eukaryotes, these proteins are also present in 
archaea such as thermoprotei, halobacteria and methano- 
microbia and bacteria classes such as betaproteobacteria 
and actinobacteridae. Thus, our genome mining analysis 
reveals that BLES03-like proteins could be phylogenetically 
distinct class of PTLs which are present in all three king- 
doms of life, unlike SpvC- and LanL-like PTLs which are 
present exclusively in bacteria (fig. 9). 
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Fig. 6. — Comparison of the active site pockets of SpvC and BLES03. (A) Active site of SpvC-like PTL (pink) and putative active site of BLES03 (blue). 
Numbered stars provide a link to corresponding residues in the multiple sequence alignment shown in figure 5. pT from MAPK peptide is shown in green. (6) 
In SpvC, conformational changes bring R220 and R148 into the active site pocket on substrate binding. SpvC (orange), SpvC with substrate (purple). (Q 1ZTP 
(BLES03) colored according to its B-factor (red: low, white: medium, green: high B-factor). Loops containing R169 and R21 5 have medium and high B-factor, 
respectively, making it likely that they can move into the interior on substrate binding. (D) Truncated ERK peptide (QYFMpTE) was docked on to BLES03 
structure 2Q4K. Blue: residues before docking, light green: residues after docking, dark green: docked pT. Dashed lines show distance between interacting 
atoms. Conformational changes similar to SpvC are seen in BLES03 on docking — loop containing R21 5 moved into the catalytic pocket. 



Genomic Neighborhood Search Also Suggests PTL 
Function for Archaeal BLES03 Homolog 

In prokaryotic genomes, it is often found that functionally 
related genes occur in close proximity to each other on the 
genome or are fused together (Overbeek et al. 1999). 



Genomic context analysis has also helped in functional anno- 
tation of proteins of unknown function and detection of novel 
pathways (de Souza and Aravind 2012). Therefore, genome 
neighborhood of BLES03-like proteins was analyzed and 
compared with function inferred from our sequence and 
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Fig. 7. — Variation of the distance between R169 and pT219 during MD simulation. The distances have been computed between side chain nitrogen of 
R169 and oxygen of pT219. The inset shows snapshot taken at the starting of simulation, when distance R169 and pT219 is quite high, that is, 12.58 A and 
at point when distance between R1 69 and pT219 was closest, that is, 7.76 A. The time point along the trajectory when the snapshots were taken is marked 
with arrow symbol. 



structure-based analysis. Functional annotation of five 
upstream and five downstream genomic neighbors of pro- 
karyotic BLES03-like proteins was analyzed using Pfam data- 
base (Finn et al. 2014). Genomic neighbors of BLES03-like 
proteins could be obtained only for 1 6 cases where complete 
or partial genome sequences were available (supplementary 
table S4 in supplementary file S2, Supplementary Material 
online). It is interesting to note that synteny of 13 archaeal 
BLES03-like proteins was conserved in terms of presence of 
other functional domains (fig. 1 0A) and was very similar to 
lantibiotic gene clusters especially, class IV lantibiotic synthe- 
tase (fig. 1 06). The neighborhood of archaeal BLES03 is usually 
rich in kinases, hydrolases, peptidase, transport proteins, and 
at least one small protein whose C-terminal was rich in C, T or 
S. Similar small proteins rich in C-terminal C, T or S residues 
found in lantibiotic synthesis pathways are precursors for 
lanthipeptides. They undergo various PTMs including eliminy- 
lation to transform into an active lanthipeptides (fig. 106). 
Peptidases, also present in the genomic neighborhood of 
LanL, cleave the N-terminal signal sequence generating a 
mature lanthipeptide which is then transported out of the 
cell by transport proteins. 

Genes encoding two-component regulatory proteins, atyp- 
ical kinase Rio1 fused with peptidase, methyltransferase, and 
acetyltransferase were also found in the genomic neighbor- 
hood of archaeal BLES03-like proteins. Analogous proteins are 



present in the genomic neighborhood of lantibiotic synthe- 
tases (Altena et al. 2000). To coordinate lantibiotic production 
with other cellular events, lantibiotic gene clusters are under 
regulation of two-component systems. A number of modifi- 
cations other than dehydration and cyclization are also 
performed by different tailoring enzymes present in these 
gene clusters. Examples of these tailoring enzymes are 
oxidative decarboxylase, dehydrogenase, and hydroxylase 
(Willey and van der Donk 2007). Also several methyltrans- 
ferases and acetyltransf erases are found in class I lantibiotic 
gene clusters (Marsh et al. 2010). Thus, our genomic neigh- 
borhood analysis revealed that archaeal BLES03 and kinases in 
its synteny are functional counterparts of lyase domain 
containing LanL-like proteins in bacteria. These archaeal 
BLES03-like proteins might be catalyzing PTL activity 
and could be involved in synthesis of lanthipeptide-like 
natural products. These gene clusters could be a new 
class of lantibiotic synthetase clusters, which are yet to be 
explored. 

Analysis of context dependent features provides additional 
and independent evidence in support of our prediction of 
PTL activity for BLES03. These evidences suggest that 
these three classes of proteins have a common ancestor but 
have evolved through sequence divergence (supplementary 
fig. S4 in supplementary file S2, Supplementary Material 
online) to occupy all three kingdoms of life. 



Genome Biol. Evol. 6(8):201 7-2033. doi:10.1093/gbe/evu161 Advance Access publication July 24, 2014 



2027 



Khater and Mohanty 



GBE 




B 



* s S 



5> 



Chordata 

Hemi chordata 

Ecdysozoa 

Dikarya 

Bacillariophyta 

Chlorophyta 

Halobacteria 

Methane-bacteria 

Thermoprotei 

Actinobacteriadae 

B eta p rote o ba cte ri a 

Ktedonobacteria 



*1 *2 *3 

gif20B4317£9$/18B- G KVYLMH L A P GF K L D HAWA G I A RAW- EGQLQV AKVS 

Ul jji'1J949223S'/205-GKWMMF I T AD- RVDKYWEAV ATQTV-SQK LODA AKVA 

gt}ie9ei 303B//229- G KWM LFPK LE - EV NQVWK K V V E GV I -NNRLGPT AKVA 

gl(126 HOTSO'/nS- a KWL I FVAAO- E U 0 E AWQ K V R D ET V- K GT L Q I S AKVS 

gtl124485473IJJ2l- GKWl IFVGPE-EVDAAWQKVRDAT I - EGRLG I G AKVS 

GKWCSFVSPT-DVDEEWSKISDAIE-NMQLM-C AKVS 

GKWLLFVAEK-Y I □ DTWR NVKKAVE — DGK LWKO AKVS 



. gitS23S3023TI/44- 
gi\33$4B3270\!BB- 



'6423-4 ' H22 34' 35464-4 7$ 500- 



GKWL I FVAP+-EVDEAWQKV+DA+V-+G+ L G I AKVS- 



*4 « 

- RQV I CVVT DDFTDRLGVLEAOSA 

RPBL I A I VT KDVADRODVKRVLER 

- DHL I C I VTRML I EPKDFRDQDDVLRVLLE 
-RSVIYVYT RDWADEADVMRV RE R 

- RK V I Y V FT ADWADEPDVMRV RAE 

-GHV I CVYT R DWG □ GO DVMRAREV 

-VYVVCVYT YDCDDESDVMK I RGH 

JL .LjL 

-14 ' 6 - 2 73 3 25 677331 

RBVICVYT + DWADEQDVMRVRE 



M 

l L 1 ¥ K P 
P t YYKC 
S I YYKS- 

- I G Y K R 

- GVKR 
ELGYKR 
-ASVKS 



-DVYTYLGI VRA — NR 259 

-DAYT YLR I LSD- - NP 300 

-DAFTYLDLYKQTASE 331 

-NIETYKOEYSE 206 

-NLDTYAGEYRE 214 

-DIDTHNRIYGS--DE 13© 

- DEQT [ AG I YSD 15fl 




T Y L G I Y S+ - 



Fig. 8. — Phylogenetic tree and conservation of putative active site residues of BLES03 homologs. (A) Phylogenetic tree of BLES03 homolog. Color on 
outer ring indicates taxonomic classes of the organism to which the respective protein belongs. To improve clarity, clades containing proteins from same 
taxonomical classes were collapsed and labeled. The numbers in bracket indicate number of sequences contained in that clade. The numbers next to 
branches indicate number of time-associated branches clustered together in a bootstrap test with 1 ,000 replicates. (6) Conservation of probable active site 
residues of BLES03-like proteins in representative sequences obtained from a sequence similarity search using BLAST. Star numbers are same as figures 4 and 
5A First three sequences are from eukaryotic organism, next two from archaeabacteria, and the last two from bacteria. For clarity few columns of the MSA 
were hidden, which is depicted by blue vertical lines. 



Independent Literature Evidences in Support of the Role 
of BLES03 as PTL 

Even though BLES03 is an acronym for basophilic leukemia 
expressed protein 03, as of today there is no information avail- 
able for the precise molecular function of this protein. In view 



of our bioinformatics prediction of PTL function for BLES03, 
we wanted to search for literature-based evidence in support 
of putative PTL activity of BLES03. Pathogenic SpvC-like 
lyases are known to affect Thr phosphorylation. Therefore, 
evidences supporting association of BLES03 with protein 
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phosphorylation pathways were searched. Though we could 
not find any published data which show overexpression of 
BLES03 in basophilic leukemia, it was interesting to note 
that in RBL-2H3.2 cell line (low releasing variant of Rat 
Basophilic Leukemia) Thr phosphorylation by Protein Kinase 
C 5 (PKC8) on y chain of FceR1 is delayed and the same variant 



Table 2 

Percentage Conservation of Putative Active Site Residues of BLES03- 
Like Proteins across All Genomes and from Archaea 



Residue No. 


% Conservation 
(BLAST Result) 


% Conservation 
(in Archaea) 


K128 


98 


100 


K159 


92 


81 


R169 


51 


45 


Y175 


96 


81 


K204 


99 


90 


R215 


50 





has defect in PKC activity (Bingham et al. 1994). Hence, it is 
tempting to speculate that Thr eliminylation activity of BLES03 
could be the reason for the defect in PKC activity. Also, anal- 
ysis of expression data on 60 different cancer cell lines (Ross 
et al. 2000) showed that BLES03 is expressed in many cancer 
cell lines and in seven such cases it is among top 5% of all 
genes ranked by expression levels. These evidences suggest 
that BLES03 has a cancer-associated expression. Our predic- 
tion of BLES03 as eukaryotic PTL could provide clues to design 
experiments for better functional characterization of BLES03- 
like proteins in multicellular eukaryotes. 

Apart from the involvement in regulation of signaling path- 
ways, in multicellular eukaryotes BLES03 could also function in 
metabolic pathways involving dehydro amino acids similar to 
LanL-like PTLs. Dhas can be produced from cysteine, seleno- 
cysteine, and phosphoserine (pS) residues in a protein and 
their presence in proteins from different human tissues has 
been hypothesized (Cooper et al. 201 1). Dha and Dhb pro- 
duced from pS and pT, respectively, have been identified in 




Fig. 9. — Phylogenetic distribution of eliminylating domains. Spv-C-like PTL (green), VenL-like proteins (red), and ESLES03-like proteins (blue). BLES03 
proteins are present in bacterial classes which also contain Spv-C-like PTL and VenL-like proteins. 
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Fig. 10. — Similar genomic neighborhood of LanL-like proteins and BLES03-like protein in archaea. Synteny of LanL-like proteins (A) and BLES03-like 
protein in archaea (6). Each protein is shown as arrows and the source organism is listed on right side. Colors on the arrows have been decoded at the 
bottom. These annotations have been made only for proteins important in lantibiotic synthesis. BLES03 protein is represented by blue arrows, whereas LanL- 
like proteins are represented by a combination of red and blue arrow. Cyclization domains from LanL-like protein have not been represented for sake of 
clarity. 
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aged and cataractogenous human lenses (Linetsky et al. 
2004), though mechanistic details of their biosynthesis are 
unknown. Also, it is known that dehydro residues undergo 
irreversible glutathionylation by addition of GSH to form 
lanthionines (Skolnick et al. 2000; Bhattacharyya et al. 2002; 
Punta et al. 2012). LanCU, human homolog of prokaryotic 
lanthionine synthase-C (LanC), was demonstrated to be a 
novel glutathione-binding protein (Chung et al. 2007). Also, 
active site residues of LanC are conserved in LanCU. Armed 
with these findings Chung et al. have hypothesized that 
LanCLI might have an activity analogous to bacterial LanC, 
wherein LanCLI is involved in formation thioether linkages 
using GSH as one substrate. The second substrate of 
LanCLI for formation of thioether linkages using GSH might 
be Dha/Dhb. In view of these literature evidence, it is possible 
that BLES03 might act upstream of LanCLI converting pho- 
phothreonine/serine to Dha/Dhb, respectively (supplementary 
fig. SSA in supplementary file S2, Supplementary Material 
online). These dehydro residues might then be acted upon 
by LanCLI to convert them to a glutathione adduct. 
A recent study has demonstrated that LanCLI binds and in- 
hibits cystathione p-synthase (CBS) (Krishnadev and Srinivasan 
2011). It was also found that GSH/GSSG ratio in cells con- 
trolled LanCLI -CBS binding and hence its activity, though 
the mechanism could not be established. Based on our bioin- 
formatics analysis, one can hypothesize that BLES03 might act 
on a Thr/Ser residue of CBS converting it to Dhb/Dha which 
can then undergo LanCLI catalyzed Michael addition with 
GSH to form glutathione-CBS adduct (supplementary fig. 
S5B in supplementary file S2, Supplementary Material 
online). This unusual PTM might inhibit CBS. Thus, literature 
evidences suggest presence of Dha/Dhb, lanthionines, and 
LanC-like enzymes, which are key components of lanthipep- 
tide biosynthetic pathways in multicellular eukaryotes. Our 
structure- and genomic context-based bioinformatics analyses 
suggest that the missing link of eukaryotic PTL could be played 
by BLES03 and biosynthetic pathways similar to lantipeptide 
biosynthetic pathways could indeed be present in multicellular 
eukaryotes including humans. 

Discussion 

Eliminylation is a novel PTM catalyzed by newly discovered 
enzyme family called PTLs. PTLs were first discovered in path- 
ogenic bacteria such as Sh. flexneri, Pseudomonas syringae 
and Sa. enterica and subsequently, in lantibiotic producing 
bacteria such as Streptomyces venezuelae and Lactococcus 
lactis subsp. lactis. Even though there is no obvious sequence 
similarity between SpvC-like PTLs and LanL-like PTLs, their 
catalytic mechanisms are strikingly similar. They work in 
close coordination with Ser/Thr kinases to posttranslationally 
modify Ser or Thr residues to Dha or Dhb, respectively. As 
several experimental studies have identified dehydro amino 
acids such as Dha/Dhb in eukaryotes including humans, it 



has been contemplated that PTLs might also be present in 
eukaryotes. However, even sensitive profile-based sequence 
comparisons have failed to detect any functional analogs of 
PTLs in eukaryotes. As protein structure is evolutionarily more 
conserved than sequence, in cases where conventional 
sequence-based homology searches fail to identify putative 
homologs, comparison of fold level structural similarity in com- 
bination with chemical similarity in the active site has helped in 
identification of proteins with similar catalytic functions 
(Skolnick et al. 2000; Bhattacharyya et al. 2002). Inspired by 
these reports, we carried out structure-based genome-wide 
search for eukaryotic PTL domains. 

Our fold level structural similarity search and comparison of 
similarities in putative active site pockets helped in detection of 
eukaryotic protein BLES03, a human protein with unknown 
function, as functional counterpart of PTLs. Structural similar- 
ity between BLES03 and elF4-E has been reported earlier (Bitto 
et al. 2005; Andreeva et al. 2008) but essential catalytic res- 
idues of elF4-E were found missing in BLES03. However, 
based on the presence of a basic surface patch having elec- 
trostatic surface potential similar to elF4-E, Bitto et al. (2005) 
have hypothesized that BLES03 might be involved in nucleic 
acid binding. In this work, we show that this basic patch of 
BLES03 contains key residues required for stable binding of 
pT-containing peptide. Similar to SpvC-like PTLs, upon sub- 
strate binding there is a conformational change in the loop 
region which facilitates movement of crucial arginines into the 
catalytic pocket and assembly of complete pocket for catalyz- 
ing eliminylation reaction. Though BLES03 shares a higher 
degree of overall structural similarity with elF4E, a more 
detailed active site comparison involving docking and MD 
simulations suggests BLES03 to be functionally analogous to 
PTLs (supplementary fig. S2 in supplementary file S2, 
Supplementary Material online). Interestingly, the crystal struc- 
ture of BLES03 shows the putative pT-binding pocket to be 
interacting with N-terminal helix of symmetry-related mono- 
mer (supplementary fig. S3 in supplementary file S2, 
Supplementary Material online). It is possible that this N- 
terminal helix plays a regulatory role by occluding the active 
site of BLES03 under normal conditions and making it catalyt- 
ically inactive. In disease conditions, the movement of this N- 
terminal helix makes the catalytic site accessible and BLES03 
functions as a PTL. Similar regulatory mechanism has been 
demonstrated in case of the human protein HYPE which 
shares structural and functional similarity with AMPylating 
enzymes found in pathogenic bacteria (Engel et al. 2012). 

In addition to overall fold and catalytic site similarity, we 
also provide several other lines of evidence to conclude an 
evolutionary link between SpvC- and LanL-like PTL and 
BLES03. A search for homologs of BLES03 across all kingdoms 
of life confirmed its presence in eukaryotes, bacteria, and 
archaebacteria. The catalytic and binding pocket residues 
identified by our structure-based search were also found 
to be evolutionarily conserved. The analysis of genomic 
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neighborhood of BLES03-like protein in archaea showed strik- 
ing similarity with class III and IV lantibiotic synthetase gene 
clusters. The similarity in synteny of archaeal homolog of 
BLES03 and LanL-like PTL is independent evidence which 
suggests PTL function for BLES03. Our genome-wide search 
has also identified lantipeptide clusters where kinase and PTL 
are present as standalone proteins unlike class III and IV lanti- 
biotic synthetases, where they occur as fused domains on a 
single polypeptide. Therefore, combining sequence analysis 
and structural comparison with classical phylogenetic tree 
analysis we have shed light on the evolutionary history of 
eliminylating domains. 

In summary, the structure-guided in silico genome mining 
study has revealed that BLES03-like proteins are putative PTLs 
and are involved in biosynthesis of dehydro amino acids in 
eukaryotes and lanthipeptide secondary metabolites in 
archaebacteria. The computational predictions presented in 
this work would provide valuable inputs to design experiments 
for identification of eukaryotic PTLs and discovery of new 
lanthipeptide gene clusters. 

Supplementary Material 

Supplementary files S1-S5 are available at Genome Biology 
and Evolution online (http://www.gbe. oxfordjournals.org/). 
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